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A CENTRAL LIMIT THEOREM VIA DIFFERENTIAL EQUATIONS 

By Taral Guldahl Seierstad^ 

University of Oslo 

In a paper from 1995, Wormald gave general criteria for certain 
parameters in a family of discrete random processes to converge to the 
solution of a system of differential equations. Based on this method, 
we show that if some further conditions are satisfied, the parameters 
converge to a multivariate normal distribution. 

1. Main theorem. In this paper, we consider parameters defined on ran- 
dom discrete processes. When the parameters change by only a smah amount 
from one state in the process to the next, one often finds that the parameters 
satisfy a law of large numbers, that is, the parameters are sharply concen- 
trated around certain values. Wormald [6] gives some general criteria which 
ensure that given parameters converge in probability to the solution of a 
system of differential equations. 

In fact, such parameters often satisfy not only a law of large numbers, 
but also a central limit theorem. Based on the differential equation method 
described in [6] and a martingale central limit theorem due to McLeish [3], 
we show that when certain general criteria are satisfied, a set of parameters 
defined on a family of discrete random processes converges to a multivariate 
normal distribution. 

As examples of processes to which this method can be applied, we consider 
in Sections 4 and 5 two random graph processes. In both processes, the 
initial state is an empty graph on n vertices, and edges are added one by 
one according to a random procedure. 

Consider a sequence {Qn,^n, Pn) of probability spaces. Let nin be a se- 
quence of numbers such that uin = 0{n) , and suppose that for each n a filtra- 
tion Tn,o C Tn,i ^ • • • ^ Tn,m^ ^ is given. Let {y^n,7n] m = 0, 1, . . . , m„} 
be a sequence of random vectors in W, for some g > 1, such that X.„ ^ is 
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measurable with respect to J-n,m for < m < m„. The kih entry in X„^m 
is denoted by Xn,m,k- For 1 < m < m„ and I < k < q, we define -^Xy^ ^ — 
^n,m - ^n,m-i and AXn,m,k = Xn,m,k - Xn,m-i,k- If V is a vector, we let 
v' be the transpose of v. If v is a column vector, we use the notation to 
mean vv'. Thus, if v is a g-dimensional vector, is a g x g-matrix. We use 
the norm ||v|| = ||v||oo- When we use the notation O(-) and o(-), we mean 
that the bounds hold as n ^ oo, unless stated otherwise; if the notation is 
used to refer to matrices or vectors, the bounds are meant to apply to every 
entry in the matrix or vector. 

If Z) C M'^, we define the stopping time Hd = HD(X.n,m) to be the mini- 
mum m such that n~^X„^m ^ D. 

The object of this paper is to find criteria which ensure that X„ con- 
verges to a multivariate normal distribution, whose mean and covariance 
matrix are continuous functions of t and can be obtained by solving cer- 
tain differential equations. The mean is obtained by applying the following 
theorem due to Wormald. 

Theorem 1 (Theorem 5.1 in [7]). Assume that there is a constant Cq 
such that Xn^m,k ^ Cqu a.s. for all n, < m < and 1 < k < q. Let 
/feiM^ ^ M, I < k < q, be functions and assume that the following three 
conditions hold, where D is some hounded connected open set containing the 
closure of 

{(zi, . ..,Zq):¥[Xnfl,k = Zk-n, l<k<q]^0 for some n}. 

(i) For some function (5 = (3{n) > 1, ||AX„^„i|| < /3, a.s. for 1 < m < 

(ii) For some function Ai = Ai(n) = o(l) and all k with 1 <k <q, 
\^[^Xn^m,k I -^m-l] — fk{n Xn ,m— 1,1 ) ■ ■ ■ ,n 

Xn ,m— l,q J I !b ^1 

for 1 <m< Hd . 

(iii) Each function f^ is continuous, and satisfies a Lipschitz condition, 
on D. 

Then the following are true. 

(a) For (zi, . . . , Zq) G D, the system of differential equations 

(1) ^ = fk{zi,...,Zq), k=l,...,q, 

has a unique solution in D for : M ^ R passing through 

Zk{0)=Zk, k = l,...,q, 
and which extends to points arbitrarily close to the boundary of D. 
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(b) Let A > Ai with A = o(l) and let r]{P,X) = f exp(-^). For a suffi- 
ciently large constant C, with probability 1 — 0{i]{l3, X)), 

Xn,m,k = nzkim/n) + 0{Xn) 

uniformly for <m < an < nin and for each k, where Zk{t) is the solution 
in (a) with % = n'^X^^.k, and a = cr{n) is the supremum of those m to 
which the solution can be extended before reaching within Loo-distance CX 
of the boundary of D . 

We can now state our main theorem, which is based on Theorem 1. The 
multivariate normal distribution with mean and covariance matrix S is 
denoted by AA(0,i;). 

Theorem 2. Assume that the conditions of Theorem 1 are satisfied, 
with f3 = o(n^/^^~^) for some e > and Ai = o(n~^/^). Furthermore, assume 
that the functions fk are differentiable, and that each partial derivative of 
fk is continuous, on D. Let zi{t), . . . , Zq{t) be the functions obtained in (b) 
of Theorem 1. Let gij :M'' — > M, 1 < i,j < q, be functions, and assume that 
the following conditions hold. 

(ii') For some function X2 = X2{n) = o(l) and all i,j with 1 <i,j <q, 

,m,l ) ■ • ■ ^n ^n,m,(j) I ^ ^2 

for 1 < m < if/) . 

(iii') Each function gij is continuous and satisfies a Lipschitz condition 
on D. 

Then there is a continuous matrix-valued function S:M— >M'?^'' such that 

^n,m-nz{m/n) d . , 

y= >7V(0,S(m/n)), 

y n 

where z{t) = [zi{t), . . . , Zq{t)]' , for < m < an. 

The proof of Theorem 2 in Section 3 also describes the procedure for 
calculating the matrix 

2. A central limit theorem for near- martingales. Our proof of Theorem 2 
will be based on a central limit theorem for multidimensional martingales. 
Let {Sn,m',^T' = 0,1,..., nin} be an array of random ^-dimensional vectors 
with S„^o = 0- We denote the A:th entry in Sn,m by Sn,m,k and let as before 
^Sn,m = Sn,m " ^n,m-i- This theorem is the multidimensional version of 
Corollary 2.6 in [3]. 
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Theorem 3. Let Sn.m clit- array as above, and let T, = {crij}ij be a 
q X q-matrix. Assume that the following conditions are satisfied. 

(i) maxm 1 1 AS 

n,m\\ has uniformly bounded second moment. 

(ii) maxm ||AS„,m|| ^ 0. 

(iii) For all l<i,j <q, Em=l ^Sn,m,i^Sn,m,j ^ CTij- 

(iv) Er=iIE[AS„,^| J-^_i]^0. 

(v) Er=iIE[AS„,^| J-^_i]2^0. 

Then S„,„„4aA(0,S). 

Proof. Corollary 2.6 of McLeish [3] asserts that the theorem is true 
when q = l and an = 1. It follows easily that the theorem also holds for 
arbitrary un in the univariate case. 

Assume that q> I, and let a = [ai, . . . , Ug]' £ be an arbitrary g-dimen- 
sional vector. Let Rn^m = J2k=i '^kSn,m,k- Since 

^Rn,m — ^ ^ Q-fc A5'ji,m,A;) 
fe=l 

it is easy to see that (i), (ii) and (iv) are satisfied for i?n,m- Assumption (v) 

means that J2m'^[^Sn,m,i \ J^m-i]E[A5„,mj | -^m-i] ^ for ah l<i,j <q. 
Hence, 

'^E[ARn,m I -^m.-l]^ = ^ OiOj ^ E[AS'„^m,i | •^m-l]E[A5„^mj | ^m.-l] 
m ij m 

tends to in probability, so (v) holds also for Rn,m- Finally, we have 

^^(A-Rri,,m) = ^ ^ I ^ ^ 0,k^Sn,m,k ) = ^ ^ ^ ] 0,iO'jASn^m,iASn^m,j 
m m \k=l / m l<i,j<q 

1<«J<'3 ™- 1<«J<? 

SO (iii) is satisfied for Rn,mn with parameter a'Sa. Hence, by the univariate 
version of the theorem, Rn,m„ AA(0, a'Sa). Since this holds for all vectors 

aGR^ it follows that Sn,m„ ^AA(0,S) (see, e.g., Definition 3.2.5 in [5]). 

□ 



3. Proof of main theorem. This section is devoted to the proof of The- 
orem 2. We are given a sequence of random g-dimensional vectors X„ ^ 
and functions fk with 1 < k < q and Qij with 1 < i,j < q such that the 
conditions of Theorem 2 are satisfied. We will generally suppress n in the 
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subscript, so we write X^, for X^^^ and so on. It follows from the assump- 
tions of Theorem 2 that we can choose a function A = o(n~^/^) such that 
A > /^n"^/^"*"^ for some e > 0. Thus, according to Theorem 1, there are func- 
tions ai{t), . . . ,aq{t) such that 

(2) X,n,k = nak{m/n)+o{n^/^) 

with probability 1 — 0{e~"'^). Let £ be the event that (2) holds for 1 < 
k < q and < m < m^. Then ¥[£] = 0{ne~'^ ). It is sufficient to prove 
that the conclusion of the theorem holds conditioned on £. Indeed, let 
a(t) = [ai(t), . . . , ag(i)]' and = n~-'^/^(Xm, — ncx{m/n)). We have for 
an arbitrary bounded continuous function 7, 

IE[7(W^)] = P[£:]E[7(W„) I £] + P[^E[7(W^) | £] 

= E[j{Wm)\£] + 0{ne-^'). 

Thus, if Wm tends to a normal distribution conditioned on £, it also tends 
to a normal distribution when not conditioned on anything. In the following, 
we therefore assume that £ holds. 

Let F : M'"' ^ M'"' be the vector- valued function whose kth component is fk', 
that is, 

7i(zi,...,Zq)" 



F{Zl,...,Zg) 



fq{zi,...,Zq)_ 

= o(n~^/^), so condition (ii) of Theo- 



By the assumption of Theorem 2, Ai 
rem 1 implies that 

(3) E[AX„ I J^m-i] = F{n~^Xm-i) + o{n-^/^). 

We write = cx[m/n). If we let t = m/n, then Taylor's theorem implies 
that 

OLm+i = cx{t + n~^) = cx{t) + + 0{n-^) 

dt 



so 
(4) 



^=^am + n-^F{am) + 0{n-^), 



analogous to (3). The Jacobian matrix of F is 

J(z) 







92:1 


dZg 


dfq 




. dzi 


dZq . 
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From calculus, we know that if a, y G W^, then 

(5) F(a + y)-F(a) = J(a)y + 0(||yf) 

as y ^ 0. We now let = — nctm be the centered version of X^- 
By (2), 

(6) Y^ = o(n3/4). 



Thus, 
(7) 



F(n-iX„) - F(a^) = F{a^ + n-^Y.^) - F{a„ 



so 



E[AY„,] = E[AX^] - nE[Aam] 

^^i^ F(n-iX„_i) - F(a„„i) + o(n-V2) 

J(a,„_i)n-iY,„_i + o(n-V2) (£) 

Thus, E[AY.m] tends to 0; however, the bound we have obtained is not 
strong enough to apply Theorem 3 directly to Y^- We will instead consider 
a transformation = TmYm; where is a q x q-matrix chosen so that 
E[AZm] = o(n~^/2) and n"^ J2m Zm,iZm,j ^ djit) for some functions ^jj(t). 
Then we will apply Theorem 3 to the array ti ' ? showing that it con- 
verges to a multivariate normal distribution. The normality of X^ will then 
be inferred from the normality of Z^. 

For ease of notation, we write A{C) = J(a(t)). Note that Aif^ is a con- 
tinuous matrix-valued function. Next, we define T(t) to be the q x g-matrix 
satisfying the differential equation 

(8) ^T{t) = -T{t)A{t\ T{0)=I. 



Lemma 1. There is an open interval (ti,t2) containing [0,cr] such that 
there is a unique solution to the differential equation (8) on (ti,t2); which 
furthermore satisfies a Lipschitz condition on (ti,t2)- If T{t) satisfies (8), 
then T{t) is invertihle for all t E (ti,t2)- Furthermore, let Tm = T{m/n), 
Am = A{m/n) and 

(9) U^ = I-n-^Am. 
Then 

(10) Tm+l=TmU,n + 0{n-^) 

for <m< an — 1 . 
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Proof. Let A{t) = {aij{t)}ij. If Ti{t) = [rii(t), • • .,Tiq{t)] is the ith row 
of T{t), then it is a solution of the system of hnear homogenous differential 
equations 

d 

(11) 'dt'^^^^^ ^ ~ ^ akj{t)Tij{t), Tij{0) = Sij, 

k=l 

which can also be written Ti{t) = —Ti{t)A{t), rj(0) = e^. Thus, every Ti{t) 
is actually a solution to the same system of linear differential equations; only 
the boundary condition is different. 

Let t[ = inf{t : a(i') G L> for t < t' < 0} and t'2 = sup{t : a(i') E D for < 
t' <t}, and choose ti,t2 such that t'l < ti < < a < t2 < By assumption, 
A{t) is continuous on (^1,^2). Hence, according to Theorem 12, Chapter 2 
of Hurewicz [1], there is a unique solution to (11) on {t[,t'2). Moreover, 
by Theorem 2, Chapter 3 of [1], the solutions Ti{t), . . . ,Tq{t) are linearly 
independent for all t £ (^1,^2) il they are linearly independent for some t G 
{t'l, t'2) ■ Thus, since r(0) = / is invertible, T(t) is invertible for all t G (^'1,^2)- 

Since A{t) and T{t) are continuous on (i'l, t2)) tbey are bounded on (ii, t2)- 
Thus, by (8), ^T{t) is bounded on (ti,t2)) and so T{t) satisfies a Lipschitz 
condition on (ii,t2)- 

Finally we obtain by Taylor's theorem that 

Tm+i = T{t + = T{t) + n-^jT{t) + 0(n"2) 

Tm - n-^T^Am + 0(n-2) ^2) T^Um + 0(n-2). □ 

The matrices A[t) and T[t) do not depend on n, so we have A[t),T{t) = 
0(1). As indicated, we now define = TmYm- The next two lemmas show 
that Zm has the properties required in order to apply Theorem 3 to the 
array n~"^/^Zm- 



Lemma 2. For all m, 

(12) AZ„, = 0(/?) a.s., 
and 

(13) E[AZ^\J^m-i] = o{n-^/^). 



Proof. We have 

(14) ^ = ^ {Tm-lUm-l + 0{n~^))Yr,^ - Tm-lY m-l 

T'm-i(f/m_iYm - Ym-i) + o(n"^). 
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By (4), nAam = 0{l), so 

||AZ„|| = 0{\\Um~iYm - Y^„i||) ^2) 0(||AY„||) + 0(n"i P„_iY„||) 
< ||AX„|| + ||nAa„|| + o{n-^/^) = 0(/3) + 0(1), 

implying (12). Then we consider (13), and first show that the conditional 
expectation of the term inside the parentheses in (14) is small. We have 

— lYjTi Y}72— 1 ^ m— ll 

= Um-i{E[X,n I J^m-l] " ««,„) - Xm_i + nOCm-l 

= (/ - n"^Am_i)(Xm_i + F(ri"^Xm_i) - na^-i - F{am-i)) 

- Xm_i + nocm-i + o(n~^/^) 
= -n-U^„i(X„„i - nQ„_i) + F(n^iX„_i) - F(q„_i) + o{n"^/'^) 
(7) 

W -n-^A^_iY^_i + n'^Am^iYm-i + 0(^-^/2) 
= o(n-i/^), 

where we for the second equality have used (3), (4) and (9). Thus, 

E[AZ^ I ^ = ^ r„,_iE[C/™_iY„ - Y„_i I J^m-i] + o(n-i) 

= r^_io(n-i/2) + o(n-i)=o(n-i/2). □ 

We now turn to the quadratic variation. 

Lemma 3. For all m, 

(15) (AZ^)2 = 0(/32) a.s. 

Moreover, for 1 < i,j < q, there is a function : M — > M, such that 

m 

(16) ^ AZk,iAZk,j ^ iij{m/n). 

k=l 

Proof. We have by (14) that 

(17) (AZ,„)2 = T„„i ([/„_! Y„ - Y^^ifT'^^^ + 0(1), 
and by (9) that 

{Um-lYm - Ym-lf = {AY mf + o{l). 
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Since nAcxm = 0(1), it follows from condition (i) of Theorem 1 that 

{AYmf = (AX„ - nAa„)(AX;, - nAa'J 
(18) = (AX^)2 - AX^nAa'^ - nAa^AX'^ + n^Aocmf 

= 0{(3^). 

This implies (15). To show (16), we take the conditional expectation and 
get 

E[(AY„)2 I ^„,_i] ^ = ^ E[(AX„)2 I ^„„i] - E[AX„ | .F,„_i]?iAa'^ 

- nAa^E[AX;, | T^^i] + n^Aa^ 

E[(AX„)2 I - F(7i-iX„_i)F(«„,_i)' 

- F(«™_i)F(n"iX^_i)' + F(q„_i)2 + o(l) 

E[(AX„)2 I Tm^^] - F(a„,_i)2 + o(l). 

Thus, by (17), 

E[(Az^)2 1 = r„_iE[([/„Y„ - Y„_i)2 1 .^„_i]r;^„i + o(l) 

(19) 

= r^_i(E[(Ax„)2 1 - F(a(t))2)r;;_, + o(i). 

Let G : ^ M^^'' be the matrix-valued function such that 

G(Z1, ...,Zq) = {gij{zi, Zq)}ij. 

Condition (ii') of Theorem 2 can then be expressed as 

(20) E[(AX^)2 I ^„,„i] = G{n-^Xm,i, . • • , n-^X^,,) + o(l). 

For l<i,i <q, let Cm.ij = EfcLi A^fc.jAZfcj, and let = {Cm.ij jjj = 
X;™=i(AZfe)2. Using (19) and (20), we find that if t = m/n, then 

- I -^m-l] = E[(AZ^)2 | 

(21) 

= r(t)(G(7i-ix„_i) - F(«(t))2)r(t)' + 0(1). 

For 1 <i,j < q, let /ijj :M'^+-'^ — > M be the functions such that 

(22) r(i)(G(^i, . ..,zg)- F(a(t))2)r(t)' = {hij{t, zi,..., Zg)}ij. 
Then it follows from (21) that for 1 <i,j < q, 

(23) \E[ACm.,i,j I -^m-i] - hij{m/n,Xm.,i/n,. . .,Xm,g/n)\ < A3, 

for some function A3 = A3(n) = o(l). 

Let Vm be a random variable such that Vm = m a.s. We will now apply 
Theorem 1 to the random variables in the set {Vm} U {Xm,k}k U {Cm,i,j}i,j- 
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Since AVm = 1, the conditions of Theorem 1 are clearly satisfied by Vm- 
Moreover, we already know by assumption that they are satisfied by X^^k 
and fk- Thus, we only have to check that they are also satisfied by Cm,i,j 
and hij. 

By (15), |ACm,jj| < ll(^Zm)^|| =0(/3^), so condition (i) is satisfied. Con- 
dition (ii) is satisfied because of (23). To see that condition (iii) is satisfied, 
we have to show that the functions hij are continuous and satisfy a Lipschitz 
condition on some area in M'^"'"^. 

Let ti and t2 be as in Lemma 1. Let 

D' = {(t,^i, . . . ,Zg):ti <t <t2, {zi, ...,Zg)£ D}. 

Let us consider F, G and T as functions from ]R^+^ to M, such that if t G M 
and z G W, then F(i,z) = F(z), G(i,z) = G(z) and T{t,z) = T{t). Since 
D' is bounded, the product of two Lipschitz continuous functions on D' is 
itself Lipschitz continuous on D' . By Lemma 1, T{t,z) satisfies a Lipschitz 
condition, and by the assumptions, F(t,z) and G(t,z) do so as well. It 
then follows from the definition of hij in (22) that hij satisfies a Lipschitz 
condition on D' . 
Let 

C^J{t)= J hj{t)dt, Cii(0)=0. 

Since 0^ = o(n-^/^), we can choose a function A' = o(l) such that A' > A3 and 
r/(/3^. A') = 0(1). Then Theorem 1 implies that 

Cm.,i,j = nS,ij{m/n) + O(A'n), 

for < m < an and 1 < i,j < q, with probability 1 — o(l). Hence, (16) is 
proved. □ 

Lemma 4. Let = n~^/^Zm and let E{t) = {S,ij{t)}i,j ■ Then 

Mm^MiO,E{m/n)). 

Proof. We will show that satisfies the conditions of Theorem 3. 

(i) By (15) in Lemma 3, 

||(AM,)'|| = n-i(AZfe)2|| = OiP^n) = o(l), 

so maxfc ||AMfc|| has uniformly bounded second moment. 

(ii) By (12) in Lemma 2, 

max||AMfc|| = n"^/^ max || AZfc|| = 0{P/ ^/n) = o{l). 
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(iii) By (16) in Lemma 3, 

m m 
k=l k=l 

(iv) By (13) in Lemma 2, 

m m 

k=l k=l 

= n-^/^m-o{n-^/^)=o{l). 

(v) Again by Lemma 2, 

m m 



l2 



fc=l k=l 

= n~^m ■ o{n~^) = o(l). 
The conclusion then follows from Theorem 3. □ 

Proof of Theorem 2. From Lemma 1, we know that T{t) is invert- 
ible, so we can define S(t) = T(t)~^H(t)(T(i)~^)'. We then conclude from 
Lemma 4 that 

^Ar(0,S(m/n)). ^ 



n 

4. Random graph processes with restricted degrees. For a positive in- 
teger d, the random d-process is a random graph process defined as follows. 
Begin with an empty graph on n vertices. Every step in the process con- 
sists of choosing two distinct vertices in the graph uniformly at random, and 
adding an edge between them if and only if the vertices are not adjacent and 
both of them have degree at most d — 1. The process ends when the graph 
no longer contains a pair of nonadjacent vertices, both of which have degree 
smaller than d. It was proved in [4] that the graph process asymptotically 
almost surely (i.e., with probability tending to 1 as n ^ oo, abbreviated 
a.a.s.) produces a graph where at most one vertex has degree d—1 while all 
other vertices have degree d. If dn is even, the final graph is a.a.s. d-regular. 

This process was used in [6] to illustrate the usage of the differential 
equation method. Here, we show that the present central limit theorem also 
can be applied to the process. Let Gm be the graph after m edges have been 
added, and let Vm^k be the random variable denoting the number of vertices 
of degree k in Gm- We follow the argument in [6] and note that 
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Moreover, |A14n^fc| < 2 always, and the domain D is chosen as —e < Zi < 
1 + £ for < k < d and e < za < 1 — £ for some e > 0. All the conditions 
of Theorem 1 are therefore satisfied, and it follows that there are functions 
7o(t), . . . ,7(i(i) such that a.a.s. 

ym,k = lk{m/n)n + o(n) 

for < < d. In order to apply Theorem 2, we note that 



d-ld-l 



SO condition (ii') of Theorem 2 holds for the functions 

d-ld-l 

Qijizi, . . . ,Zq) = ^Y^ (I- z )2 ('^'^'^-l ~ - 



We choose D to be the same as earlier, and note that the functions gij satisfy 
a Lipschitz condition on D. Theorem 2 then implies the following theorem. 

Theorem 4. Let V„, = [Vm,o, V„^^d\' and ■y{t) = [7o(t), • ■ • ,7d(*)]'- 
Let 5 > and let be the smallest value for which ^din^s/^) > 1 — 5- 
There is a continuous matrix-valued function such that 

Vm- n-y{m/n) d 



n 



■M{0,T.{m/n)) 



for <m< ms . 



5. The minimum degree random graph process. Our second application 
is the first phase of the minimum-degree graph process, first introduced 
in [7]. One complication in this case is that the graph process has a nat- 
ural random stopping time, and we will show that the random variables 
under consideration also have a jointly normal distribution at the end of the 
process. 

For a fixed n, the minimum degree graph process is a sequence of graphs 
{G™™}m,>o which is constructed as follows. The initial graph G™™ is an 
empty graph on n vertices. For m > 1, let Vm be a vertex chosen uniformly at 
random from the vertices of minimum degree in G™™]^, and let Wm be chosen 
uniformly at random from the vertices distinct from Vm- The graph G™™ 
is obtained from G™™^ by adding to it the edge {vm,Wm)- For simplicity, 
we will allow multi-edges; however, in the stages of the process we consider, 
there will a.a.s. be so few multi-edges that they make no significant difference 
to the calculations. 
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Let H be such that G^^™ does not contain isolated vertices, while G^™^ 
contains at least one isolated vertex. In [7], it was proved that a.a.s. H = 
hn + o(n), where /i = ln2. In this paper, we will consider the graph only up 
to the point H; that is, we add edges at random until there are no isolated 
vertices left, and then we stop. Thus, H \s a. stopping time of the process, 
and we consider the process G™^^. It is easy to see that no cycle can be 
formed before time so G™" is a forest. In [7], it was furthermore proved 
that the number of vertices of any degree is sharply concentrated around 
the expectation. Instead of the vertex degrees, we will consider the order of 
the components in G™™, and in particular in G^™. For A: > 1 and m > 0, let 
Cm,k be the random variable denoting the number of components in G,™™ of 
order k, and let Ck = CH,k- Let Cm = [Cm,i, Cm,q]' and C = [Gi, . . . , Cg]' , 
where > 1 is some fixed natural number. In [2], it was shown that Cm,k = 
f3k{m / n)n + o{ri) a.a.s., where 

P^{t) = \{l-e-'f-\[k + l)e-' -I), 
k 

Let /3(t) = . . . We will prove the following theorem. 

Theorem 5. There is a continuous matrix-valued function such 
that 

(24) ^--"f("^/"U AA(0,£Wn)) 

forO<^<h. 

k2 



Let fi = {Tw}k=i- Then there is matrix S such that 



(25) ^^4aA(0,S). 

\ n 



Proof. Assume first that — = t < h, where t is a constant. When a new 
edge {vm,Wm) is added, Vm is by definition an isolated vertex, while Wm can 
have any degree, and be in a component of any order. Let Vm be the random 
variable denoting the order of the component containing Wm- Then 

(26) ^Cm,k = —5kl — ^k,Vm + ^k-l,Vm- 

The probability of choosing a vertex in a component of order k is 

n — 1 

so the expected change in the number of components of order k is 
E[ACm,k I ^,n-i] = fk{n-'Cm,i, n-^Cm,q) + o{n-^'^), 
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where 

(27) fk{zi,.. .,Zq) = -6ki - kzk + {k- l)zk-i. 
Furthermore, it is clear that C^^k ^ i^ Sind (26) imphes that 

(28) AC^^k < 2 

for m> 1 and 1 < k < q. The set D can be chosen as e < zi < 1 + e and 
— e < Zfc < 1 for 2 <k <q for any e > 0. Then satisfy a Lipschitz condition 
on D. We obtain a system of differential equations of the form (1), and it 
can be shown that it has the solution Zk = f3k{t), satisfying the boundary 
conditions /3fc(0) = (5^1 • Let to = h — 6. For every 5 > 0, we can choose e so 
small that the solution does not leave D until t > to- It follows that a.a.s. 

Cm,k = Pk{t)n + o{n) 

for 1 <k <q and <t < h, with t fixed. This was already shown in [2]. 

In order to apply Theorem 2, we need an expression for the conditional 
expectation of ACm,iACm,j ■ This is 

i]=9ij{n ^Cm,i,...,ra ^Cm,g) +o(l), 

where 

(29) gij{zi, ...,Zq) = ^ kzk{-5ii - 5ki + Sk,i-i){-Sji - 6jk + Skj-i). 

k>l 

We let D be as earlier. Then the conditions of Theorem 2 are satisfied, and 
we conclude that there is a matrix E(t) such that (24) holds. 

We then turn to (25). Let mo = L^o^J • By (28), \CH,k - Cmo,k\ < 25n. By 
letting 5 go to 0, we can conclude that 

n~^CH,k ^ Pkih) = -g^- 

Unfortunately, we cannot obtain the matrix S simply by setting t = h va. 
S(t), since the stopping time H \s a random variable. Instead, we define new 
random variables . . . , ^ which behave like Cm,k up to m = H , but 

which we can analyze even after H. Let for m > 1 be defined as follows. 

If Cm_i,i > 0, let = Vm- Otherwise, let V^ = k with probability 

for 2 < < g and q + 1 with probability 1 - J2k=2 n''' ■ ^'o.fc = "^^ki and 
define 

Cm,fc = C^-i,k - ^ki - h,v^ + h-iy^- 
Clearly, ^ = Cm,k for 0<m< H. We observe that 

E[AC°k I -^m-i] = fk{n-'C°„ n-'C°) + o{n-'/^) 
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and 

E[AC;^,,AC;^,^. I Tm-x\ = mjin-'C°^,i, n-^C°^^^) + o(l) 

when m <hn + o{n), where fk and gij are defined by (27) and (29), respec- 
tively. Let D° C M'? be the defined by -e < < 1 + e for 1 < A; < g. Then 
the requirements of Theorem 2 are satisfied, and we can conclude that 



n 



AA(0,5](/i)). 



However, we are interested in the distribution of Ch = C^, so we have to 
find the difference between and C^^. For 1 < k < q, let be ran- 
dom variables such that [Wf , . . . , W°]' ~ AA(0, S(/i)). Thus, for example, 

n~-'^/^C^„ ^ Wi- Let r/„ = Then writing m' = m — hn, 

m — hn . 
r]n > ^— <^H>m<:^C' i>0 



m 



2g-m/n _ ^ ^-m' /n _ ^ 



n J ^/n ^/n 

f m\ m — hn 



^W^i(^)> (1 + 0(1))- 



n 



so r]n W°. 

When m = hn + o{n), F[V^ = k] = ^ + o(l) for 1 < /c < Let 

HVhn 

Bk = sgn{H - hn) ^ 5y„,fc- 

HAhn 

Thus, \Bk\ is the number of times a vertex in a component of order k is chosen 
between H and hn. Then E[Bk] = {H - hn){^ + o(l)) and one easily sees 
that E[\Bk\i\Bk\ - 1)] = (1 + o(l))E[|Sfc|]2, so by Chebyshev's inequality, 
Bk ~ (-ff — hn)^^. Hence, n~^/'^Bk ^^"Hn, and we can conclude that 

Ck-n{k-l)/{k2^d^^^^ 



n 

where VFfc = + VFf for 1 < < Since {Wi, . . . ,Wq} are hnear com- 
binations of {Wi, . . . , VFJ}, they are jointly normal random variables. □ 

When the functions fk are linear, as in this section, it becomes easier to 
calculate S(t) explicitly, than in the nonlinear case. The matrix A is then 
a constant matrix and the solution of (8) is T{t) = e~*^, where the matrix 

exponential is defined as e*^ = J2i>o 
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In our example, the Jacobian matrix is J = {j6ij+i — jSij}ij, and A = J, 

so 
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The covariance matrix is then given by 




which can be used as a general formula for the linear case. 
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